Unsupervised Audio Source Separation via Spectrum Energy Preserved Wasserstein Learning

نویسندگان

  • Ning Zhang
  • Junchi Yan
  • Yu Chen Zhou
چکیده

Separating audio mixtures into individual tracks has been a long standing challenging task. We introduce a novel unsupervised audio source separation approach based on deep adversarial learning. Specifically, our loss function adopts the Wasserstein distance which directly measures the distribution distance between the separated sources and the real sources for each individual source. Moreover, a global regularization term is added to fulfill the spectrum energy preservation property regarding separation. Unlike state-of-the-art unsupervised models which often involve deliberately devised constraints or careful model selection, our approach need little prior model specification on the data, and can be straightforwardly learned in an end-to-end fashion. We show that the proposed method performs competitively on public benchmark against state-of-the-art unsupervised methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised Learning based Modified C- ICA for Audio Source Separation in Blind Scenario

Separating audio sources from a convolutive mixture of signals from various independent sources is a very fascinating area in personal and professional context. The task of source separation becomes trickier when there is no idea about mixing environment and can be termed as blind audio source separation (BASS). Mixing scenario becomes more complicated when there is a difference between number ...

متن کامل

Deep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning

Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...

متن کامل

Image alignment via kernelized feature learning

Machine learning is an application of artificial intelligence that is able to automatically learn and improve from experience without being explicitly programmed. The primary assumption for most of the machine learning algorithms is that the training set (source domain) and the test set (target domain) follow from the same probability distribution. However, in most of the real-world application...

متن کامل

Blind Audio Source Separation in Time Domain using

Algorithms for Blind Audio Source Separation (BASS) in time domain can be categories as based on complete decomposition or based on complete decomposition. Partial decomposition of observation space leads to additional computational complexity and burden, to minimize resource requirement complete decomposition technique is preferred. In this script an optimized divergence based ICA technique is...

متن کامل

Deformed Statistics Free Energy Model for Source Separation using Unsupervised Learning

A generalized-statistics variational principle for source separation is formulated by recourse to Tsallis’ entropy subjected to the additive duality and employing constraints described by normal averages. The variational principle is amalgamated with Hopfield-like learning rules resulting in an unsupervised learning model. The update rules are formulated with the aid of q-deformed calculus. Num...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1711.04121  شماره 

صفحات  -

تاریخ انتشار 2017